Quality Estimation from Scratch

نویسندگان

  • Julia Kreutzer
  • Artem Sokolov
چکیده

This thesis presents a deep neural network for word-level machine translation quality estimation. The model extends the feedforward multi-layer architecture by [Collobert et al., 2011] to learning continuous space representations for bilingual contexts from scratch. By means of stochastic gradient descent and backpropagation of errors, the model is trained for binary classification of translated words, given only the source sentence and the machine translation. We enhance this model with alignments, and unsupervised pre-training of word representations allows for leveraging large monolingual corpora for supervised quality estimation training. Evaluating it on the data provided by the Workshop on Statistical Machine Translation 2014 and 2015, the model yields competitive results across languages and datasets. A linear combination of the deep model and a shallow linear model trained on baseline features further improves over both individual models. Furthermore, the bilingual word representations learnt during supervised training for quality estimation prove useful for other cross-lingual tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

QUality Estimation from ScraTCH (QUETCH): Deep Learning for Word-level Translation Quality Estimation

This paper describes the system submitted by the University of Heidelberg to the Shared Task on Word-level Quality Estimation at the 2015 Workshop on Statistical Machine Translation. The submitted system combines a continuous space deep neural network, that learns a bilingual feature representation from scratch, with a linear combination of the manually defined baseline features provided by the...

متن کامل

Questing for Quality Estimation A User Study

Post-Editing of Machine Translation (MT) has become a reality in professional translation workflows. In order to optimize the management of projects that use post-editing and avoid underpayments and mistrust from professional translators, effective tools to assess the quality of Machine Translation (MT) systems need to be put in place. One field of study that could address this problem is Machi...

متن کامل

Detection of Line Scratch in Video

Most common defects are flicker, dirt, dust and line scratches.Here we consider line scratch detection.Line scratches appear as thin bright or dark line.This line are usually straight and vertical.The restoration of old videos is based on primary interest because of great quantity of old film records. But manual digital restoration of videos is time consuming process. To detect the scratch in f...

متن کامل

Quality Hound - An online code smell analyzer for scratch programs

In this showpiece, we demonstrate the functionality of Quality Hound — an online program analysis tool that takes as input a Scratch project and presents to the user a visual representation of the detected quality problems. Made accessible via a browser-based user interface, Quality Hound is instantaneously accessible to any Scratch user all over the world. The design of Quality Hound is inform...

متن کامل

Prediction of aqueous solubility from SCRATCH.

This study proposes the SCRATCH model for the aqueous solubility estimation of a compound directly from its structure. The algorithm utilizes predicted melting points and predicted aqueous activity coefficients. It uses two additive, constitutive molecular descriptors (enthalpy of melting and aqueous activity coefficient) and two non-additive molecular descriptors (symmetry and flexibility). Th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016